Applicability Domains for Classification Problems: Benchmarking of Distance to Models for Ames Mutagenicity Set

نویسندگان

  • Iurii Sushko
  • Sergii Novotarskyi
  • Robert Körner
  • Anil Kumar Pandey
  • Artem Cherkasov
  • Jiazhong Li
  • Paola Gramatica
  • Katja Hansen
  • Timon Schroeter
  • Klaus-Robert Müller
  • Lili Xi
  • Huanxiang Liu
  • Xiaojun Yao
  • Tomas Öberg
  • Farhad Hormozdiari
  • Phuong Dao
  • Süleyman Cenk Sahinalp
  • Roberto Todeschini
  • Pavel G. Polishchuk
  • Anatoly G. Artemenko
  • Victor Kuzmin
  • Todd Martin
  • Douglas M. Young
  • Denis Fourches
  • Eugene N. Muratov
  • Alexander Tropsha
  • Igor I. Baskin
  • Dragos Horvath
  • Gilles Marcou
  • Christophe Muller
  • Alexandre Varnek
  • Volodymyr V. Prokopenko
  • Igor V. Tetko
چکیده

The estimation of accuracy and applicability of QSAR and QSPR models for biological and physicochemical properties represents a critical problem. The developed parameter of "distance to model" (DM) is defined as a metric of similarity between the training and test set compounds that have been subjected to QSAR/QSPR modeling. In our previous work, we demonstrated the utility and optimal performance of DM metrics that have been based on the standard deviation within an ensemble of QSAR models. The current study applies such analysis to 30 QSAR models for the Ames mutagenicity data set that were previously reported within the 2009 QSAR challenge. We demonstrate that the DMs based on an ensemble (consensus) model provide systematically better performance than other DMs. The presented approach identifies 30-60% of compounds having an accuracy of prediction similar to the interlaboratory accuracy of the Ames test, which is estimated to be 90%. Thus, the in silico predictions can be used to halve the cost of experimental measurements by providing a similar prediction accuracy. The developed model has been made publicly available at http://ochem.eu/models/1 .

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Merging Applicability Domains for in Silico Assessment of Chemical Mutagenicity

Using a benchmark Ames mutagenicity data set, we evaluated the performance of molecular fingerprints as descriptors for developing quantitative structure-activity relationship (QSAR) models and defining applicability domains with two machine-learning methods: random forest (RF) and variable nearest neighbor (v-NN). The two methods focus on complementary aspects of chemical mutagenicity and use ...

متن کامل

Evaluation of (Q)SAR models for the prediction of mutagenicity potential

Developing alternative methods to in vivo testing is critical to the cosmetic industry based on ethical reasons, the REACh reglementation and the 7th Amendment of the European Directive on Cosmetics. A number of (Q)SAR models are commercially available, and building a strategy based on more than one such system is relevant considering the differences in models and applicability domains. The pre...

متن کامل

Ames Mutagenicity Assessment of Flavored Water Pipe Tobacco Products :A Cross Sectional Study in Tehran

Waterpipe smoking has become a global youth trend especially in the Middle East countries and Iran . The aim of this study was to determine the mutagenic effects of three most popular flavored tobaccos by four different salmonella typhimurium strains and compare the possible mutagenic effects of the test samples. Ames mutagenicity assessment was conducted according to the OECD guideline using T...

متن کامل

Robustified distance based fuzzy membership function for support vector machine classification

Fuzzification of support vector machine has been utilized to deal with outlier and noise problem. This importance is achieved, by the means of fuzzy membership function, which is generally built based on the distance of the points to the class centroid. The focus of this research is twofold. Firstly, by taking the advantage of robust statistics in the fuzzy SVM, more emphasis on reducing the im...

متن کامل

Evaluation of Mutagenicity of Mebudipine, a New Calcium Channel Blocker

Mebudipine is a new dihydropyridine calcium channel blocker, synthesized in our laboratory, for treatment of hypertension. It has shown a better efficacy than other drugs in this group. For assessing the risks of this drug, certain safety tests in the preclinical stage have been performed. In this study mutagenic effect of mebudipine was evaluated using Ames assay that could assess the mutageni...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Journal of chemical information and modeling

دوره 50 12  شماره 

صفحات  -

تاریخ انتشار 2010